Robust Multi-Class Gaussian Process Classification

نویسندگان

  • Daniel Hernández-Lobato
  • José Miguel Hernández-Lobato
  • Pierre Dupont
چکیده

Multi-class Gaussian Process Classifiers (MGPCs) are often affected by overfitting problems when labeling errors occur far from the decision boundaries. To prevent this, we investigate a robust MGPC (RMGPC) which considers labeling errors independently of their distance to the decision boundaries. Expectation propagation is used for approximate inference. Experiments with several datasets in which noise is injected in the labels illustrate the benefits of RMGPC. This method performs better than other Gaussian process alternatives based on considering latent Gaussian noise or heavy-tailed processes. When no noise is injected in the labels, RMGPC still performs equal or better than the other methods. Finally, we show how RMGPC can be used for successfully identifying data instances which are difficult to classify correctly in practice.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Multi-class Classification with Dependent Gaussian Processes

We present a novel multi-output Gaussian process model for multi-class classification. We build on the formulation of Gaussian processes via convolution of white Gaussian noise processes with a parameterized kernel and present a new class of multi-output covariance functions. The latter allow for greater flexibility in modelling relationships between outputs while being parsimonious with regard...

متن کامل

Negative Selection Based Data Classification with Flexible Boundaries

One of the most important artificial immune algorithms is negative selection algorithm, which is an anomaly detection and pattern recognition technique; however, recent research has shown the successful application of this algorithm in data classification. Most of the negative selection methods consider deterministic boundaries to distinguish between self and non-self-spaces. In this paper, two...

متن کامل

Second Order Cone Programming Formulations for Handling Data with Perturbation

Ordinal regression problem and general multi-class classification problem are important and on-going research subject in machine learning. Support vector ordinal regression machine (SVORM) is an effective method for ordinal regression problem and has been used to deal with general multi-class classification problem. Up to now it is always assumed implicitly that the training data are known exac...

متن کامل

Outlier Robust Gaussian Process Classification

Gaussian process classifiers (GPCs) are a fully statistical model for kernel classification. We present a form of GPC which is robust to labeling errors in the data set. This model allows label noise not only near the class boundaries, but also far from the class boundaries which can result from mistakes in labelling or gross errors in measuring the input features. We derive an outlier robust a...

متن کامل

Spectral Feature Selection for Automated Rock Recognition using Gaussian Process Classification

A spectral feature selection scheme is proposed for multi-class automated rock recognition from real world drilling data using Gaussian Process classification. This work is part of a larger project aimed at surface mine automation. The motivation for this research is to investigate which combination of drilling data measurements is most relevant for rock recognition. We conduct feature selectio...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2011